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RANDOM MATRIX TECHNIQUES IN QUANTUM INFORMATION THEORY 


BENOiT COLLINS AND ION NECHITA 


Abstract. The purpose of this review article is to present some of the latest developments using random 
techniques, and in particular, random matrix techniques in quantum information theory. Our review is 
a blend of a rather exhaustive review, combined with more detailed examples - coming from research 
projects in which the authors were involved. We focus on two main topics, random quantum states and 
random quantum channels. We present results related to entropic quantities, entanglement of typical 
states, entanglement thresholds, the output set of quantum channels, and violations of the minimum 
output entropy of random channels. 


Contents 


1. Introduction 2 

2. Background on quantum information: quantum states and channels 3 

2.1. Quantum states 3 

2.2. Entropies 3 

2.3. Quantum Channels 4 

2.4. Graphical notation for tensors 4 

3. Background on random matrix theory and free probability 5 

3.1. Gaussian random variables 5 

3.2. Unitary integration. Weingarten calculus 6 

3.3. Graphical interpretation of Wick and Weingarten calculus 8 

3.4. Some elements of free probability theory 8 

4. Entanglement of random quantum states 10 

4.1. Probability distributions on the set of quantum states 10 

4.2. Moments. Average entropy 14 

4.3. Entanglement 15 

4.4. Entanglement criteria 16 

4.5. Absolute separability 19 

5. Deterministic input through random quantum channels 20 

6. Random quantum channels and their output sets 21 

6.1. Early results on random unitary channels 21 

6.2. Results with a fixed output space 21 

6.3. More results about the output of random channels 22 

7. The additivity problem for tensor products of random quantum channels 22 

7.1. The classical capacity of quantum channels and the additivity question 22 

7.2. Conjugate quantum channels and the MOE of their tensor product 23 

7.3. Early results in relation to the violation of MOE, history and the state of the art 24 

8. Other applications of RMT to quantum spin chains volume laws 25 

8.1. Maximum entropy principle for random matrix product states 25 

8.2. Multiplicative bounds for random quantum channels 26 

8.3. Sum of random projections on tensor products 27 

8.4. Area laws for random quantum states associated to graphs 27 

9. Conclusions and open questions 28 

References 29 


Date: September 16, 2015. 


1 



2 


BENOIT COLLINS AND ION NECHITA 


1. Introduction 

Quantum computing was initiated in 1981 by Richard Feynman at a conference on physics and com¬ 
putation at the MIT, where he asked whether one can simulate physics on a computer. As for quantum 
information theory, it is somehow the backbone of quantum computing, although it emerged indepen¬ 
dently in the 60s and 70s (with, among others. Bell inequalities). In the last 20-30 years, it has witnessed 
a very fast development, and it is now a major scientific field of its own. 

In classical information theory, probabilistic methods have been at the heart of the theory from its 
inception [Sha48]. In contrast, probabilistic methods have arguably played a less important role in the 
infancy of quantum information theory - although probability theory itself was cast at the heart of the 
postulates of quantum mechanics (Bell inequalities themselves are a probabilistic statement). However, 
the situation has dramatically changed in the last 10-15 years, and probabilistic techniques have nowadays 
proven to be very useful in quantum information theory. Quite often, these probabilistic techniques are 
very closely related to problems in random matrix theory. 

Let us explain heuristically why random matrix theory is a natural tool for quantum information 
theory, by comparing with the use of elementary probability in the concept of classical information. In 
classical information, the first - and arguably one of the greatest - success of the theory was to compute 
the relative volume of a typical set with Shannon’s entropy function. Here, the main probabilistic tool 
was the central limit theorem (more precisely, an exponential version thereof). The central limit theorem 
is a tool that is very well adapted to the study of product measures on the product of (finite) sets. In 
quantum information, sets and probability measures on theses sets, as well as measurements, are all 
replaced by matrices, and their non-commutative structure is central to quantum information theory. In 
addition, the use of ‘probabilistic techniques’ in classical information theory is not a goal per se. It is a 
convenient mathematical tool to prove existence theorems, for which non-random proofs are much more 
difficult to achieve. Incidentally, it is rather natural to wonder what a ‘typical’ set looks like. 

The situation is quite similar in quantum information theory: although there was no a priori need for 
random techniques, some problems - in particular the minimum output entropy additivity problem, which 
we discuss at length in this review ~ did not have an obvious non-random answer, therefore it became not 
only natural, but also important, to consider how ‘typical’ quantum objects behave. This was arguably 
initiated by the paper [HLSW04]. The results obtained in the first papers were of striking importance in 
quantum information theory, and they pointed at the fact that well established mathematical tools could 
be expected to be very useful tools to solve problems in quantum information theory. All these problems 
were naturally linked with probability measures on matrix spaces. The first tool that was recognized to 
play an important role was concentration of measure. But more generally, all techniques were connected 
to the theory of random matrices. Random matrix theory relies on a wide range of technical tools, and 
concentration of measure is one of them among others. 

The first results using random techniques in quantum information theory attracted a few mathe¬ 
maticians ~ including the authors of this review, see also the bibliography - who undertook to apply 
systematically the state of the art in random matrix theory in order to study questions in quantum 
information theory. 

Random matrix theory itself has a long history. Although it is considered as a field of mathematics 
(or mathematical physics), it was not born in mathematics, but rather in statistics and physics. Wishart 
introduced the distribution that bears his name in the 1920’s, in order to explain the discrepancy between 
the eigenvalues of a measured covariance matrix, and an expected covariance matrix. And Wigner had 
motivations from quantum physics when he introduced the semi-circle distribution. Since then, random 
matrix theory has played a role in many fields of mathematics and science, including: 

• Theoretical physics [Wit86, Kon94, Meh04] 

• Combinatorics and algebraic geometry [HZ86, Oko04] 

• Integrable systems and PDF [TW02] 

• Complex analysis and Riemann-Hilbert problems [KMVAV04] 

• Operator algebras [VDN92] 

• Telecommunication [TV04] 
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• Finance [BP03] 

• Number theory [KS99] 

The above list does certainly not exhaust the list of fields of application of random matrix theory, but 
quantum information theory is definitely one of the most recent of them (and a very natural one, too!). 

Our goal in this paper is to provide an overview of a few important uses that random matrix theory 
had in quantum information theory. Instead of being exhaustive, we chose to pick a few topics that look 
important to us, and hopefully emblematic of the roles that random matrices could play in the future in 
QIT. Obviously, our choices are biased by our own experience. 

This paper is organized as follows Section 2 provides some mathematical notation for quantum infor¬ 
mation. It is followed by Section 3, that supplies background for random matrix and free probability 
theory. The remaining sections are a selection of applications of random matrix theory to quantum in¬ 
formation, namely: 4: Entanglement of random quantum states, 5: properties of output of deterministic 
states (of interest) under quantum channels, 6 : study of all outputs under specific random quantum 
channels, 7: the solution to the MOE additivity problems, and finally 8 : other applications of RMT in 
quantum physics, and 9: a selection of open questions. 


2. Background on quantum information: quantum states and channels 


2.1. Quantum states. We denote the set of d-dimensional mixed quantum states (or density matrices) 
by Vd,: 

Vd := {p G (C) : p > 0 and TV(p) = 1}. (2.1) 

The collection of density matrices Dd is naturally associated to the Hilbert space C'^. One of the 
fundamental postulates of quantum mechanics is that two disjoint systems can be studied together by 
taking their Hilbert tensor product. Eor example, a system with state space and another one with 
state space are studied together with the state space ® . In particular, the density matrices 

have the structure of 'Dd^d 2 - Of particular interest is the subset 

•S£Vdud2 ■= conv(T>rf^ (8) T’da) ^ ^did2- 


This convex compact subset is interpreted as the collection of all ‘classical’ density matrices on the 
bipartite state. Unless di = 1 or ^2 = 1 it is a strict subset of 'Ddid 2 - It is called the collection of separable 
states. States which are not separable are called entangled. The study of entangled states is one of the 
cornerstones of quantum information theory. As a first (and paramount) example of entangled state, 
consider the qubit Hilbert space endowed with an orthonormal basis { 61 , 62 }, and the state 


9 H2 — ^2^2) 


where 002 


1 

Vi 


(61 (g) 61 -I- 62 ® 62), 


called the maximally entangled qubit state. More generally, the maximally entangled state of two qudits 
is 


£^d‘2 3 — ^d^*d: 


where 



d 

^6i (g) 6j. 
i=l 


( 2 . 2 ) 


2.2. Entropies. As in classical information theory [Sha48], entropic quantities play a very important 
role in quantum information theory. We define next the quantities of interest for the current work. Let 
Afc = {x G I ~ 1} be the {k — l)-dimensional probability simplex. For a positive real number 

p > 0, define the Renyi entropy of order p of a probability vector x G to be 

k 

i=l 

Since limp_^i H^{x) exists, we define the Shannon entropy of x to be this limit, namely: 

k 

H{x) = H^{x) = - log Xj. 

i=l 
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We extend these definitions to density matrices by functional calculus: 

H'P{p) = -^logTr/; 
l-p 

H{p) = H^{p) = -Tr/rlogp. 


2.3. Quantum Channels. In Quantum Information Theory, a quantum channel is the most general 
transformation of a quantum system. Quantum channels generalize the unitary evolution of isolated 
quantum systems to open quantum systems. Mathematically, we recall that a quantum channel is a linear 
completely positive trace preserving map from Aln(C) to itself. The trace preservation condition 
is necessary since quantum channels should map density matrices to density matrices. The complete 
positivity condition can be stated as 

Vd > 1, (g) Irf ; M-nd{C) —)■ M-nd{C) is a positive map. 

The following two characterizations of quantum channels turn out to be very useful. 


Proposition 2.1. A linear map <I> : Aln(C) — )• Aln(C) is a quantum channel if and only if one of the 
following two equivalent conditions holds. 

(1) (Stinespring dilation) There exists a finite dimensional Hilbert space K, = a density matrix 
Y G A4d{C) and an unitary operator U G U{nd) such that 

^X) = Ti^[U{X®Y)U*], VXGMn(C). (2.3) 

(2) (Kraus decomposition) There exists an integer k and matrices Li,... ,Lk G Aln(C) such that 

k 

m) = E LiXL*, VXGMn(C). 
i=l 

and 

k 

i=l 

(3) (Choi matrix) The following matrix, called the Choi matrix of <I> 

n 

M„2(C) 3 C^ = [id ® $](0d) = ^ Eij ® (2.4) 

is positive-semidefinite. 


It can be shown that the dimension of the ancilla space /C in the Stinespring dilation theorem can be 
chosen d = dim/C = and that the state Y can always be considered to be a rank one projector. A 
similar result holds for the number of Kraus operators: one can always find a decomposition with k = 
operators. 

Going back to the entropic quantities, of special interest for the computation of capacities of quantum 
channels to transmit classical information are the following expressions, indexed by some positive real 
parameter p 

= min HP{^{p)). 
pex>n 


2.4. Graphical notation for tensors. Quantum states, tensors, and operations between these objects 
(composition, tensor product, applying a state through a quantum channel, etc) can be efficiently rep¬ 
resented graphically. The leading idea is that a string in a diagram means a tensor contraction. Many 
graphical theories for tensors and linear algebra computations have been developed in the literature 
[Pen05, CoelO]. Although they are all more or less equivalent, we will stick to the one introduced in 
[CNlOb], as it allows to compute the expectation of random diagrams in a diagrammatic way subse¬ 
quently. For more details on this method, we refer the reader to the paper [CNlOb] and to other work 
which make use of this technique [CNlla, CNlOa, CNZIO, CNZ13, FS13, CGGPG13, Lanl5] 
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Figure 1. Some simple diagrams 



In the graphical calculus matrices (or, more generally, tensors) are represented by boxes. Each box has 
differently shaped symbols, where the number of different types of them equals that of different spaces 
(exceptions are mentioned bellow). Those symbols are empty (while) or filled (black), corresponding to 
primal or dual spaces. Wires connect these symbols, corresponding to tensor contractions. A diagram 
is a collection of such boxes and wires and corresponds to an element of an abstract element in a tensor 
product space. 

Rather than going through the whole theory, we focus on a few key examples. 

Suppose that each diagram in Figure 1 comes equipped with two vector spaces Vi and V 2 which we 
shall represent respectively by circle and square shaped symbols. In the first diagram, M is a tensor (or 
a matrix, depending on which point of view we adopt) M G Fj* ( 8 > Fi, and the wire applies the contraction 
F]* (8) Fi —>■ C to M. The result of the diagram Va is thus = Tr(M) G C. In the second diagram, 
again there are no free decorations, hence the result is the complex number = {y,Mx). Finally, in 
the third example, is a (2, 2) tensor or a linear map N G End(Fi (8 F 2 , Fi ( 8 ) F 2 ). When one applies to 
the tensor N the contraction of the couple (Fi, Fj*), the result is the partial trace of N over the space Fi: 
Tp^ = Try,(Ar) GEnd(F2,F2). 

3. Background on random matrix theory and free probability 
3.1. Gaussian random variables. The probability density of the normal distribution is: 

f{x \ tJ.,cr) = — = e 2 -^ 
a\/27r 

Here, /i is the mean. The parameter a is its standard deviation with its variance then fj^. A random 
variable with a Gaussian distribution is said to be normally distributed 

Suppose X and Y are random vectors in such that {X, Y) is a 2A:-dimensional normal random 
vector. Then we say that the complex random vector Z = X + iY has the complex normal distribution. 

Historically the first ensemble of random matrices having been studied is the Wishart ensemble [Wis28], 
see [BSIO, Chapter 3] or [AGZIO, Section 2.1] for a modern presentation. 

Definition 3.1. Let X G Aldxs(C) be a random matrix with complex, standard, i.i.d. Gaussian entries. 
The distribution of the positive-semidefinite matrix W = XX* G Ald(C) is called a Wishart distribution 
of parameters {d, s) and is denoted by Wd^s- 

The study of the asymptotic behavior of Wishart random matrices is due to Marcenko and Pastur 
[MP67], while the strong convergence in the theorem below has has been proved by analytic tools such 
as determinantal point processes. Let us also record it as a direct consequence of the much more general 
results [MaH2]. 

Theorem 3.2. Consider a sequence Sd of positive integers which behaves as Sd ^ cd as d ^ 00 , for some 
eonstant c G (0, 00). Let Wd be a sequenee of positive-semidefinite random matrices such that Wd ~ Y2d,s,i- 
Then, the sequence Wd eonverges strongly to the Marcenko-Pastur distribution tTc given by 

TTc = max(l - c,0)(5o + . . . . ~ l(^a,b)ix) dx, (3.1) 

where a = (1 — ^/c)‘^ and 6 = (1 + ^/c)‘^. 

The Marcenko-Pastur distribution ttc is sometimes called the free Poisson distribution. We plotted in 
Figure 2 its density in the cases c = 1 and c = 4. 
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density 




Figure 2. The density of the Marcenko-Pastur distributions tti (left) and 714 (right). 


The following theorem is the link between combinatorics and probability theory for Gaussian vectors: 
it allows to compute moments of any Gaussian vector thanks to its covariance matrix. 

A Gaussian space F is a real vector space of random variables having moments of all orders, with 
the property that each of these random variables has centered Gaussian distributions. In order to spec¬ 
ify the covariance information, such a Gaussian space comes with a positive symmetric bilinear form 
{x,y) —)• E[xy]. Gaussian spaces are in one-to-one correspondence with euclidean spaces. In particular, 
the euclidean norm of a random variable determines it fully (via its variance) and if two random variables 
are given, their joint distribution is determined by their angle. The following is usually called the Wick 
Lemma: 

Theorem 3.3. Let V be a Gaussian spaee and xi,...,Xk be elements in V. If k = 21 1 then 

E[xi • • • Xfc] = 0 and if k = 21 then 


i 

E[xi ■■■Xk]= (3.2) 

pairing 

In particular it follows that if xi,... ,Xp are independent standard Gaussian random variables, then 

E[x^...xJ1 = [](2fci)!! . 

i=l 

3 . 2 . Unitary integration. Weingarten calculns. This section contains some basic material on uni¬ 
tary integration and Weingarten calculus. A more complete exposition of these matters can be found in 
[G 0 IO 3 , CS06]. We start with the definition of the Weingarten function. 

Definition 3.4. The unitary Weingarten function Wg(n,cr) is a function of a dimension parameter n 
and of a permutation a in the symmetric group §p. It is the inverse of the function a 1 —?• under the 

convolution for the symmetric group (ffa denotes the number of cycles of the permutation a). 

Note that the function a 1 —?• is invertible when n is large, as it behaves like as n —)• 00 . If 
n < p the function is not invertible any more. For the definition to make sense, one needs to take the 
pseudo inverse (we refer to [GS06] for historical references and further details). We use the shorthand 
notation Wg(cj) = Wg(n, a) when the dimension parameter n is clear from context. 

The function Wg is used to compute integrals with respect to the Haar measure on the unitary group 
(we shall denote by U{n) the unitary group acting on an n-dimensional Hilbert space). The first theorem 
is as follows: 
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Theorem 3.5. Let n he a positive integer and i = {ii,... ,ip), i' = {ii,...,ip), j = {ji, ■ ■ ■ ,jp), = 

(ji) • • • jJp) be p-tuples of positive integers from {1,2,... ,n}. Then 



U, 






Uj! r dU = 

•^pjp 


If p ^ p' then 



o',Te§p 


UipjpUi[j[ 


• Ui' dU = 0. 

pfdpf 


(3.4) 


Since we perform integration over large unitary groups, we are interested in the values of the Wein- 
garten function in the limit n —)> oo. The following result encloses all the information we need for our 
computations about the asymptotics of the Wg function; see [Col03] for a proof. 


Theorem 3.6. For a permutation a G Sp, let Cycles{a) denote the set of cycles of a. Then 


Wg{n,a) = 

(_l)n-#. Yl Wg(n,c)(l + 0(n-2)) 

cG Cycles((7) 

(3.5) 

Wg(n, (1,.. 

.,(!)) = {-if -^Cd-i n {n-j)~^ 

(3.6) 


-d+l<j<d-l 


where Ci = is the i-th Catalan number. 

As a shorthand for the quantities in Theorem 3.6, we introduce the function Mob on the symmetric 
group. Mob is invariant under conjugation and multiplicative over the cycles; further, it satisfies for any 
permutation a G §p: 

Wg(n, cj) = n“^^“''l'^l^(Mob((T) + 0(n“^)) (3.7) 

where |it| = p — ffcr is the length of a, i.e. the minimal number of transpositions that multiply to cr. We 
refer to [CS06] for details about the function Mob. We finish this section by a well known lemma which 
we will use several times towards the end of the paper. This result is contained in [NS06]. 

Lemma 3.7. The function d{a,T) = Icr^^rl is an integer valued distance on §p. Besides, it has the 
following properties: 

• the diameter of §p is p — 1; 

• d{-,-) is left and right translation invariant; 

• for three permutations cji,(T 2 ,t G §p, the quantity d(r,cji) + d(r,CJ 2 ) has the same parity as 
d{ai,a2); 

• the set of geodesic points between the identity permutation id and some permutation cr G §p is in 
bijection with the set of non-crossing partitions smaller than tt, where the partition n encodes the 
cycle structure of a. Moreover, the preceding bijection preserves the lattice structure. 


Finally, we introduce a definition which generalizes the trace function: for some matrices Ai, A 2 ,..., Ap G 
Ain(C) and some permutation a G §p, we define 

Tr^(Ai,... ,Ap) = Tr (AjjAjj • • • AjJ . 

cGCycles((T) 
c=(ii *2 ■■■ ik) 

We also put Tro-(A) = Tro-(A, A,..., A). 
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3.3. Graphical interpretation of Wick and Weingarten calculus. Our main motivation for the 
graphical calculus from to allow to interpret nicely the above integration theorems 3.3, 3.5. We consider 
first the case of the Weingarten calculus. The key to an interpretation relies on the concept of removal 
of boxes U and U. 

A removal r is a way to pair decorations of the U and U boxes appearing in a diagram. It consists in 
a pairing a of the white decorations of U boxes with the white decorations of U boxes, together with a 
pairing (5 between the black decorations of U boxes and the black decorations of U boxes. Assuming that 
V contains p boxes of type U and that the boxes U (resp. U) are labeled from 1 to p, then r = (a, (3) 
where a,/3 are permutations of Sp. The set of all removals of U and U boxes is denoted by Rem[/(2?). 

A removal r G Remf/(2?), yields a new diagram T>j. associated to r, which has the important property 
that it no longer contains boxes of type U oi U. One starts by erasing the boxes U and U but keeps the 
decorations attached to them. Assuming that one has labeled the erased boxes U and U with integers 
from {1,... ,p}, one connects all the (inner parts of the) white decorations of the i-th erased U box with 
the corresponding (inner parts of the) white decorations of the a(i)-th erased U box. In a similar manner, 
one uses the permutation f3 to connect black decorations. In [CNlOb], we proved the following result: 

Theorem 3.8. The following holds true: 

Eu{V)= Y. VrWg{n,ar^). 

r=(o,/3)sRem[7(X>) 

In the case where diagrams also involve a box G corresponding to a Gaussian random matrix, we are 
also able to compute the expected value conditional to the u-algebra of G by graphical methods, yielding 
a new interpretation of Wick formula 

Namely, the expectation value of a random diagram T can be computed by a removal procedure as 
in the unitary case. Without loss of generality, we assume that we do not have in our diagram adjoints 
of Gaussian matrices, but instead their complex conjugate box. This assumption allows for a more 
straightforward use of the Wick Lemma 3.3. As in the unitary case, we can assume that T) contains 
only one type of random Gaussian box G; the other independent random Gaussian matrices are assumed 
constant at this stage as they shall be removed in the same manner afterwards. 

A removal of the diagram 2? is a pairing between Gaussian boxes G and their conjugates G. The set 
of removals is denoted by RemG'(P) and it may be empty: if the number of G boxes is different from the 
number of G boxes, then RemG(P) = 0 (this is consistent with the first case of the Wick formula (3.2)). 
Otherwise, a removal r can identified with a permutation a G Sp, where p is the number of G and G boxes. 
The main difference between the notion of a removal in the Gaussian and the Haar unitary cases is as 
follows: in the Haar unitary (Weingarten) case, a removal was associated with a pair of permutations: one 
has to pair white decorations of U and U boxes and, independently, black decorations of conjugate boxes. 
On the other hand, in the Gaussian/Wick case, one pairs conjugate boxes: white and black decorations 
are paired in an identical manner, hence only one permutation is needed to encode the removal. 

To each removal r associated to a permutation a € Sp corresponds a removed diagram T>r constructed 
as follows. One starts by erasing the boxes G and G, but keeps the decorations attached to these boxes. 
Then, the decorations (white and black) of the i-th G box are paired with the decorations of the a{i)-th. 
G box in a coherent manner, see Figure 3. 

The graphical reformulation of the Wick Lemma 3.3 becomes the following theorem, which we state 
without proof. 

Theorem 3.9. The following holds true: 

^Gp] = Y 

rgRem(3(I5) 

3.4. Some elements of free probability theory. A non-commutative probability space is an algebra .A 
with unit endowed with a tracial state (f. An element of A is called a (non-commutative) random variable. 
In this paper we shall be mostly concerned with the non-commutative probability space of random 
matrices (A4n(L°°~(H,P)),E[n“^ Tr(-)]) (we use the standard notation L°°~p,F) = np>iL^(H, P)). 
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Figure 3. Pairing of boxes in the Gaussian case 


Let q. 1 ,... ^Ak be subalgebras of A having the same unit as A. They are said to be free if for all 
tti G Aj^ {i = 1,... A) such that (f){ai) = 0, one has 

(t){ai • • • afc) = 0 


as soon as ji A 32 , ji A 35, ■ ■ ■ ,3k-i A 3k- Collections Si, S 2 , ... of random variables are said to be free 
if the unital subalgebras they generate are free. 

Let (ai,...,aA;) be a A:-tuple of selfadjoint random variables and let C{Xi,...,Xk) be the free *- 
algebra of non commutative polynomials on C generated by the k indeterminates Xi, ..., Xk- The joint 
distribution of the family {ad^=i is the linear form 

/^(ai,...,afe) : ; ^k) —t C 

P i-A (j){P{ai,.. .,ak)). 

In the case of a single, self-adjoint random variable x, if the moments of x coincide with those of a 
compactly supported probability measure /r, i.e. 

Vp>i, 


we say that x has distribution /r. The most important distribution in free probability theory is the 
semicircular distribution 

A 4 — x‘^ 

k-SC{0,l) = - ^ - A- 2 , 2 ]ix)dx, 

which is, for reasons we will not get into, the free world equivalent of the Gaussian distribution in classical 
probability (see [NS06, Lecture 8] for the details). A random variable x having distribution fJ.sc{o,i) has 
the Catalan number for moments: 


AxP) 


Catp := 


if p is even 
if p is odd. 


More generally, if x has distribution psc{ 0 , 1 ), say that y = ax + m has distribution 


i/4cj2 - (x - m)2 

k'SC{m,P) 27rC7^ X[rn= 2 cr,m+ 2 a]\X)dX. 


(3.8) 


Given a /c-tuple (oi,..., Ok) of free random variables such that the distribution of a* is pai, the joint 
distribution is uniquely determined by the Pafs. A family {af ,..., a'^)n of fc-tuples of random 

variables is said to converge in distribution towards (ai,..., a^) iff for all P G C(Ai,..., Xk), A*(ay,...,a^)(-P) 
converges towards P(ax,...,ak){P) as n —>• 00 . Sequences of random variables {aA)n, ■ ■ ■ ,{oA)n are called 
asymptotically free as n —)■ 00 iff the /c-tuple {af,..., aA)n converges in distribution towards a family of 
free random variables. 

The following result was contained in [Voi98] (see also [CS06]). 
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Figure 4. The density of the semicircular distributions Hsc{o,i) (left) and fJ,sc{i,i/ 4 ) (right). 

Theorem 3.10. Let be a collection of independent Haar distributed random matrices o/A4n(C) 

and a set of constant matrices of M.n{C) admitting a joint limit distribution as n —)> oo with 

respect to the state n~^ Tr. Then, almost surely, the family {uj^\ admits a limit *-distribution 

{uk, 'Wk}k&N with respect to n~^ Tr, such that ui, U 2 , ■ ■ ■, {wi,W 2 , ■ ■ •} are free. 

Given two free random variables a,b G A, the distribution Pa+b is uniquely determined by pa and p^. 
The free additive convolution of pa and pb is defined by pa^ Pb = Pa+b- When x = x* G A, we identify 
Px with the spectral measure of x with respect to r. The operation ffl induces a binary operation on the 
set of probability measures on M. 

4. Entanglement of random quantum states 

4.1. Probability distributions on the set of quantum states. 

4.1.1. Random pure quantum states. The first model for random quantum states we look at is the uniform 
measure on pure quantum states. Indeed, the set of pure quantum states of a finite dimensional Hilbert 
space = C'^ can be identified, up to a phase, with the set of points on the unit sphere of TL, {x G : 
||x|| = 1}. On this set, there is a canonical probability measure, the uniform (or Lebesgue) measure. 

Definition 4.1. A random pure quantum state x G is said to follow the uniform distribution if x is 
uniformly distributed on the unit sphere ofC^. We denote the uniform distribution of pure states in 
by Xd- 

The uniform distribution has the following important properties [Nec07, Section 2.1]. 

Proposition 4.2. Let x G be a uniformly distributed pure quantum state, x ~ Xd- Then: 

(1) For any unitary operator U GlJd (U can either he fixed or random, but independent from x), the 
random pure state Ux also has the uniform distribution, Ux ~ Xd- 

(2) IfGGC^ is random complex Gaussian vector, X ~ Afc{0,ln), then X/||X|| is a uniform quantum 
pure state, X/||X|| ~ Xd- 

(3) Let U be a random unitary matrix distributed along the Haar measure on Un and let y he the first 
column of U. Then y G is a uniform quantum pure state, y ^ Xd- 

In applications, whenever one needs to consider generic pure quantum states and that there is no un¬ 
derlying structure in the Hilbert space where the states live, the uniform measure is used indiscriminately. 
Later, in Section 4.1.4, we shall encounter another probability distribution on a Hilbert space H, which 
is to be used in the case where the space has a tensor product structure H = Hi 0 • • • Hk. 

A different possibility was considered in [NP13], starting from the first point in 4.2, and replacing the 
Haar unitary U with the value of the unitary Brownian motion at some fixed time t (recall that the Haar 
measure is recovered at the limit t —)• oo). The resulting measure depends on the time t > 0 and on the 
initial vector x on which the unitary acts. We refer the interested reader to [NP13] for the details. 
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4.1.2. The induced ensemble. We introduce in this section a family of probability distributions on the set 
of (mixed) quantum states Dd which has a nice physical interpretation and, at the same time, a simple 
mathematical presentation. 

The following family was introduced by Braunstein in [Bra96] and studied by Hall [Hal98], and later, 
in great detail, by Zyczkowski and Sommers [ZSOl, SZ04]. 

Definition 4.3. Given two positive integers d, s, consider a random pure quantum state x G C'^ (8> C^. 
The distribution of the random variable M 

p = [idd 0 Trs](xx*) G Vd 

is called the induced measure of parameters {d,s) and it is denoted by i'd,s- 

We gather in the following proposition some basic facts about the measures (for the proofs, see [ZSOl]). 

Proposition 4.4. Let Vd 9 p ~ i'd,s be a density matrix having an induced distribution of parameters 
{d, s). 

(1) With probability one, p has rank min((i, s). 

(2) For any unitary operator U Gldd (fixed or independent from p), the density matrix UpU* has the 
same distribution as p. 

(3) There exist a unitary matrix U £ Ud and a diagonal matrix A = diag(Ai,..., A^) such that U 
is Haar distributed, U and A are independent, and p = UAU*; we say that the radial and the 
angular part of p are independent. 

(4) The eigenvalues (Ai,..., Xd) have the following joint distribution: 

d d 

Cd,siAi+...+Ad=i n ^^>^0 n 

i=i ^=1 

where Cd,s is the constant 

^ Tjds) 

ntons-^nd+l-iy 

Remark 4.5. Importantly, in the case s = d, the distribution Vd^d is precisely the Lebesgue measure 
on the compact set Vd, seen as a subset of the affine subspace {A G A4™(C) : Tr(A) = 0}, see [ZSOl, 
Section 2.4]. The measure Vd,d is sometimes called the Hilbert-Schmidt measure, since it is induced by the 
Euclidian, or Hilbert-Schmidt, distance. Note that the volume ofVd is given by [Z+03, Equation (4.5)] 

vol(®,) = 

In [Nec07], the induced measures Vd^s are shown to be closely related to the Wishart ensemble yVd,s 
from Definition 3.1. 

Proposition 4.6. Let W G AidiC) be a Wishart matrix of parameters {d,s) and put p := W/Ty{W) G 
Vd. Then 

(1) The random variables p and Tr(lT) are independent. 

(2) The distribution o/Tr(lT) is chi-squared, with ds degrees of freedom. 

(3) The random density matrix p follows the induced measure of parameters {d,s), i.e. p ~ Vd,s- 

(4) The random variable W, conditioned on the (zero probability) event Tr(lT) = 1, has distribution 

^d,s ' 

Let us now discuss the asymptotic behavior of the probability measures iyd,s- We first consider the 
“trivial” regime, where d is fixed and s —)• oo. The result here is as follows, see [Nec07]. 

Proposition 4.7. For a fixed dimension d, consider a sequence of random density matrices {ps)s having 
distribution ps ~ nd,s- Then, almost surely as s ^ oo, ps ^ d~^Id. 
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The interesting scaling is the fixed ration one, where both d and s = Sd grow to inhnity, in such a way 
that Sd/d —)> c, for a fixed constant c G (0,oo), The next result is an easy consequence of Theorem 3.2 
and Proposition 4.6. 

Proposition 4.8. For a fixed positive constant c, consider a sequence of random density matrices {pd)d 
having distribution pd ~ ^< 1 , 3 ^; here we assume that Sd ^ cd as d —>■ oo. Then, almost surely as d —)• oo, 
the empirical eigenvalue distribution of the random matrix SdPd converges weakly to the Marcenko-Pastur 
distribution tTc from (3.1) 

1 

it” rfE= 

1=1 

Informally, the result above can be stated as follows: consider a tensor product Hilbert space FL = 

CpciJ aj^d random, uniform pure state € FL. Then, the eigenvalues of the partial trace p = [id®Tr](fj'ijj*) 
are, up to a scaling of cd, distributed along the Marcenko-Pastur distribution tTc (3.1). 

Finally, as suggested by Proposition 4.6, in order to simulate on a computer quantum states having 
distribution i'd,s, one sets 

GG* 

Tt{GG*)' 

where G G A4rfxs(C) is an element from the Ginibre ensemble, i.e. G has i.i.d. standard complex Gaussian 
entries; see [ZPNCll, Section III.D]. 


4.1.3. The Bures measure. The Bures metric on the set of density matrices (see [BZ06]) is defined as 

dB{p,cr) = 

From this metric, one can define a probability distribution vb on Tdi by asking that Bures balls of equal 
radius have the same volume. 

The properties of the measure vb have been extensively studied in [Hal98, SZ03, OSZIO], we recall in 
the next proposition the main facts. 

Proposition 4.9. Let p G T>d be a random density matrix having distribution vb- Then 

(1) The eigenvalues Xi,... ,Xd of p have distribution 

GBlAi+-+Ad=l 11 lA,>oAi 11 ■ . ,T.~’ 

i=l ^ ^ 

where the eonstant Cb reads 

g, r(dV2) 

(2) If A G A4rf(C) is a random Ginibre matrix and U G Ud is a Haar random unitary independent 
from A, then the random matrix 

{I + U)AA*{I + U)* 

^ “ Tr[{I + U) A A* {I + U)*] 

has distribution vb- 

4.1.4. Random states associated to graphs. The probability distributions on Vd we have considered so far 
do not make any assumptions on the internal structure of the underlying Hilbert space C'^. To address 
this issue, in [CNZIO, CNZ13] the authors introduce and study a new family of ensembles of density 
matrices, called random graph states, which encode the underlying structure of the Hilbert space. We 
introduce next these distributions, referring the interested reader to [CNZIO, CNZ13] for the details. 
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Consider a graph G = {V, E) having k vertices Vi,... ,Vk and m edges Ei,, Em- Let iV be a fixed 
positive integer, and consider the (total) Hilbert space 

k 

i=l 

where Tii = is the local Hilbert space at vertex i and di is the degree of Vi in G. Each copy of 

inside Hi is associated to some edge Ej incident to V), in such a way that the total Hilbert space 
admits two decompositions, relative to vertices and edges: 

k m 

i=i j=i 

where JCj = (8> G^ . Define now the following random pure state 

k 

) 

where {Ui}^^-^ are i.i.d. Haar distributed random unitary matrices acting on the local Hilbert spaces at 
the vertices, and ojj are maximally entangled states (2.2). Note that in the above expression, the unitary 
operators “mix” the product of maximally entangled states at the vertices, yielding, in general, a global 
entangled state. 

Let us now define mixed quantum states with the above formalism. For a subset S C {1, 2,..., 2m} of 
copies of C'^, define 

PG,S = [ids (8) Trsc] {^gEg) ^ '^n\s\ ■ 

The statistical properties of the distribution of pG,s are studied in [CNZIO, Section 5]. 

Here, we show that the area law holds exactly for graph states, provided that the marginal under 
consideration satisfies a particular condition, called adaptability. 

To any graph state we associate two partitions of the set of n = 2m subspaces: a vertex partition 
T’vertex which encodes the vertices of the graph, and a pair partition Hedge which encodes the edges 
(corresponding to maximally entangled states). More precisely, two subsystems Hi and Hj belong to the 
same block of 'Pvertex if they are attached to the same vertex of the initial graph. Each edge (i,j) of 
the graph contributes a block of size two {i,j} to the edge partition Hedge- Recall that a marginal of a 
random graph state ^g^*g is specified by a 2-set partition "Ptrace = {*55 Rj- 

Let us introduce now a fundamental property of the (random) quantum states associated to graphs. 

Definition 4.10. A marginal ps is called adapted if 

H trace — Hyertex (^R) 

for the usual refinement order on partitions- In other words, a marginal is adapted if and only if the 
number of traced out systems in each vertex is either zero or maximal- If this is the case, then the 
partition boundary, which splits the graph into parts {5, Tj, does not cross any vertices of the graph. 

Because of the above property, for adapted marginals, we can speak about traced out vertices, because 
if one subsystem of a vertex is traced out, then all the other systems of that vertex are also traced out. We 
now dehne precisely what we mean by area laws in the context of quantum states associated to graphs. 
The partition {5, T} dehnes a boundary between the set of vertices that are traced out and vertices that 
survive. 

Definition 4.11. The boundary of the adapted partition {S', T} is defined as the set of all (unoriented) 
edges e = {isGr} in the graph state with the property that is & S and jx G T. Equivalently, it is the set 

of edges of the type *-•. The boundary of a partition shall be denoted by dS. 

The area of this boundary is its cardinality |9S'|, i.e. the number of edges between S and T. 
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It was shown in [CNZ13] that the area law holds exactly for adapted marginals of graph states, where 
we allow arbitrary dimensions of subsystem. Note that, for a given (boundary) edge we have 

di = dj, the common dimension of the maximally entangled state corresponding to the edge The 

following result follows from linear algebra considerations, and one does not need random Haar nnitary 
operators in this case. 

Proposition 4.12. Let ps be an adapted marginal of a graph state ipc- Then, the entropy of ps has the 
following exact, deterministic value: 

H{ps) = \dS\\ogN. (4.2) 


P2 


V4 



Figure 5. An adapted marginal for a graph state. The dashed (green) line represents 
the boundary between the traced-out subsystems T and the surviving snbsystems S. 


For the system corresponding to the graph shown in Figure 5 with all subsystems of size N the von 
Neumann entropy reads 

H{ps) = blog N. (4.3) 

This follows from the fact that ps is in this case a unitary conjugation of a maximally mixed state of size 
with an arbitrary pure state of size N^. 

We refer the reader to Section 8.4 for a more general resnlt in this direction (for non-adapted marginals). 

4.2. Moments. Average entropy. In this section we present resnlts concerning certain quantities of 
interest in qnantum information theory, and in particular their average values over the different ensembles 
introduced previously. 

Let us start with the case of the uniform measure on the set of pure quantum states. The statistics 
of the coordinates of a uniform random pnre state can be obtained by the so-called spherical integrals 
[Foll3, Section 2.7]. The following resnlt could also be deduced from the Wick formula in Section 3.1 or 
from the Weingarten formula in Section 3.2. 


Lemma 4.13. For any non-negative integers ai ,..., > 0, we have 


={d- 1)! 


ai!a2! • • • ctd}. 


{d — 1 ai ci2 + • • • + cid)! 


We move now to the case of random density matrices having the induced distributions discussed 
in Section 4.1.2. Using the relation between this distribution and the Wishart ensemble, the following 
result has been shown in [SZ04, Nec07]. 


Proposition 4.14. The moments of a random density matrix p G Vd having distribution Vd^s o,re given 
by 

rjds) ..i-i [s + q- j]q[d + q- j]q 

r{ds + q)^^^ ^ {q-j)l{j-l)[ ’ 


ET7(p^) 
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where [a]g 


a{a 


1)•••(a 


q + l)In particular, the first few moments read 


ETY(p2) 

ETV(p3) 

ETY(/) 


d + s 
ds “t" 1 

df + 2>ds + + 1 

[ds + l)(ds + 2) 

d^ + Qd^s + Qds"^ + + 5d + 5s 

{ds + l){ds + 2){ds + 3) 


The average entropy of a random density matrix was conjectured by Page in [Pag93] and later proved 
in [FK94, SR95, Sen96]. 


Proposition 4.15. The average von Neumann entropy of a random density matrix having distribution 
Vd,s is 

, ^1 d-l 

= E i 

i=s+l 

4.3. Entanglement. The notion of quantum entanglement has been recognized to be at the center of 
quantum mechanics from the early days of the theory. The reader interested in entanglement theory is 
referred to the excellent review paper [HHHH09]. In this work, we will only deal with bipartite entangle¬ 
ment, which is defined as follows. First, we say that a quantum state p G Tnk is separable iff it can be 
written as a convex combination of tensor product states: 

r 

P = '^pm ® Ti, 

i=l 

where ai G Vn, u G Vk and (pi) is a probability vector: pi > 0 and J2iPi — 1- The set of separable states 
is denoted by S£Vn,k T Vnk and the states in its complement are called entangled. 

In this section, we are going to review some results about the (Euclidean) volume of the set of separable 
states. Equivalently, volumes can be expressed, up to a factor, from the probability that a quantum state 
is separable, under the induced measure r'nk,nk, see Remark 4.5. 

The first result in this direction is quite remarkable [GB02]. It has many interesting corollaries, one of 
them being that the set SSV of separable states has non-empty interior. 


Proposition 4.16. The largest Euclidean ball centered at the maximally mixed state I / {nk) and contained 
in Vnk is separable and has radius [nk{nk — 1)]“^/^. 

In the case of the Euclidean measure Vnk,nk is has been shown in [AS06, Theorem 1] that the ratio 
between the volume of S£Vn,n and Vn^ vanishes when n —>■ oo. In the case where the parameter s of 
the induced measure Vnk,s grows to infinity, while n and k are kept hxed, the measure Unk,s concentrates 
around the maximally mixed state Ink (see Proposition 4.6), so 

dm e S£Vnk] = 1. 

More precise estimates have been obtained in [ASY14] in the case of the induced measures. In order 
to present these results, we need hrst to introduce the concept of thresholds. 

Consider a family of sets of density matrices C V^. The idea of a threshold captures the behavior 
of the probability that a quantum state p G V^ is an element of X^, when the probability is measured 
with the induced measure i'd,sj w® would like to know, when d —>• oo, for which values of the parameter 
s, the probability vanishes or becomes close to 1. More precisely, we say that a threshold phenomenon 
with value cq on the scale / occurs when the following holds: let Sd ~ cf{d) for a constant c > 0; Then 

(1) If c < Co, limrf^oo [p G Xd] = 0. 

(2) If C > Co, limrf^oo [P ^ = 1- 

This definition was first considered in the Quantum Information Theory literature by Aubrun in [Aubl2] 
to study the PPT criterion (see next section). 
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We state now the main resnlt in [ASY14], regarding the threshold for the sets S£Vn,n- The following 
statement corresponds to [ASY14, Theorem 2.3], which deals with the so-called balanced regime k = n. 
For the unbalanced regime k ^ n, see [ASY14, Section 7.2], 

Theorem 4.17. There exist constants c,C and a function f{n) satisfying 

cn^ < f{n) < C'n^log^(n) 

such that 

(1) If Sn < f{n), lim^^oo 2 [P^ S£Vn,n\ = 0. 

(2) If Sn > fin), lim^^ooT!. 2 [P ^ S£Vn,n\ = 1- 

Note that the above result does not enter precisely in the threshold framework, as it was defined just 
above; one would need to eliminate the logarithm factors and to compute exactly the constants in the 
statement above to achieve this, see Question 9.1. The result is nevertheless an important achievement, 
given the fact that questions dealing directly with the set of separable states are usually very difficult. 

4.4. Entanglement criteria. The question whether a given mixed quantum state is separable or en¬ 
tangled has been proven to be an NP-hard one [Gur03]. To circumvent this worse-case intractability, 
entanglement criteria are used. These are efficiently computable conditions which are necessary for 
separability; in other words, an entanglement criterion is a (usually convex) super-set of the set of 
separable states, for which the membership problem is efficiently solvable. As in the previous section, 
from a probabilistic point of view, estimating the probability that a random quantum state (sampled 
from the induced ensemble) is an element of is central. In what follows we shall tackle this problem 
for different entanglement criteria in the framework of thresholds. 

Let us start with the most used example, the positive partial transpose criterion (PPT). The PPT 
criterion has been introduced by Peres in [Per96]: if a quantum state p G Vnk is separable, then 

:= [id® transpK/o) > 0. 

Note that the positivity of p^ is equivalent to the positivity of p"^ = [transp® id](/9), so it does not matter 
on which tensor factor the transpose application acts. We denote by VVTn,k the set of PPT states 

'P'P'Tn,k ■= {p £ Unk '■ p'" > 0} Q S£Vn,k- 

This necessary condition for separability has been shown to be also sufficient for qubit-qubit and qubit- 
qutrit systems (nk < 6) in [HHH96]. The PPT criterion for random quantum states has first been 
studied numerically in [ZPBC07]. The analytic results in the following proposition are from [Aubl2] (in 
the balanced case) and from [BN13] (in the unbalanced case); see also [FS13] for some improvements in 
the balanced case and the relation to meanders. 


Proposition 4.18. Consider a sequence pn G IInk„ of random quantum states from the induced ensemble 
nnk„,cnk„, whcrc kn is a function of n and c is a positive constant. 

In the balanced regime kn = n, the (properly rescaled) empirical eigenvalue distribution of the states pn 
converges to a semicircular measure Psc(i,i/c) of mean 1 and variance 1/c, see (3.8). In particular, the 
threshold for the sets VVTn,n (n —)■ ooj is cq = 4. 

In the unbalanced regime kn = k fixed, the (properly rescaled) empirical eigenvalue distribution of the 
states Pn converges to a free difference of free Poisson distributions (see Section 3-4 for the definitions) 


^cfc(fc+l)/2 S '^ck(k—l)/2- 

In particular, the threshold for the sets VVTn,k (k fixed, n ^ oo) is 


Co — 2 -|- 2 



We consider next the reduction criterion (RED). Introduced in [HH99, CAG99], the reduction criterion 
states that if a bipartite quantum state p G Vnk is separable, then 

:= [id®R](p)>0, 
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where R : A1fc(C) —)> M.k{C) is the reduction map, 

R{X) = 4 • TV(X) - X. 

We denote by TZSVn^k the set of quantum states having positive-semidefinite reductions (on the second 
subsystem) 

R'£'Rn,k '■= {p £ Rnk ’■ > 0} 3 S£Vn,k- 

Several remarks are in order at this point. First, it is worth mentioning that in the literature, the reduction 
criterion is sometimes defined to ask that both reductions, on the first and on the second subsystems, are 
positive-semidefinite; since going from one reduction to the other one can be done by simply swapping 
the roles of C” and C^, we focus in this work on the reduction on the second subsystem. We gather in 
the next lemma some basic properties of the set TZ£T>n,k, see, e.g. [JLN14] for the proof. 

Lemma 4.19. The reduction criterion is, in general, weaker than the PPT criterion: 

S£Vn,k ^ 'P'PTn,k ^ T^£'£^n,k ^ d^nk- 

2 (i.e. when the system on which the reduction map acts is a qubit), the two criteria are 

d^£Vn,k = 'P'PTn,k- 

Although the reduction criterion is weaker than the PPT criterion for the purpose of detecting entan¬ 
glement, its interest stems from the connection with the distillability of quantum states, see [HHH98]. 

We gather in the next proposition the values of the thresholds for the sets TZ£'Dn^k- Since, in the case 
of the reduction criterion, the tensor factor on which the reduction map R acts does matter, we need to 
consider two unbalanced regimes: one where n is fixed and A: —)• oo, and a second one where n —)• oo and 
k is kept fixed. The results below have been obtained in [JLN14] (for the second unbalanced regime) and 
in [JLN15] (for the balanced regime and the first unbalanced regime). 

Proposition 4.20. The thresholds for the sets TZ£T>n,k o,re as follows: 

(1) In the balanced regime, where both n,k ^ oo, the threshold value for the parameter s of the 
induced measure Vnk,s is on the scale s ^ cn at the value cq = 1. 

(2) In the first unbalanced regime, where n is fixed and k —)• oo, the threshold value for the parameter 
s of the induced measure fnk^s is on the scale s ^ c at the value cq = n. 

(3) In the second unbalanced regime, where k is fixed and n —)■ oo, the threshold value for the parameter 
s of the induced measure fnk^s is on the scale s ~ cnk at the value 

_ (1 + Vfc + l)^ 
k{k — 1) 

Let us mention now that both thresholds for the PPT and the RED criterion, in the unbalanced case, 
have been treated, in a unihed manner, in the recent preprint [ANV15]. A general framework is developed 
in [ANV15] in which many examples of entanglement criteria fit. 

Criteria of the type [id (8) /] (p) > 0 have been studied from a random matrix theory perspective in 
[CHN15] in the case of random linear maps /. In [CHN15], the authors introduce a family of entanglement 
criteria index by probability measures. The main idea is to consider maps / between matrix algebras 
obtained from random Choi matrices. More precisely, consider a compactly supported probability measure 
p, and let G Xind{C) a sequence of unitarily invariant random matrices converging in distribution, as 
d —)■ oo, to p (n being kept fixed). Let fd : A4n(C) —)■ A4rf(C) be a (random) linear map such that the 
Choi matrix (2.4) of fd is Xd- Then, the positivity of the map fd, asymptotically as d —)• oo, depends only 
on p and its free additive convolution powers [CHN15, Theorem 4.2] (see Section 3.4 for the definition of 
convolutions in free probability). 

Theorem 4.21. The sequence of random linear maps {fd)d has the following properties: 

(1) //supp(p®"^/^) C (0,oo), then, almost surely as d ^ oo, fd is k-positive. 

( 2 ) //supp(p®"'/*^) n (—oo, 0) / 0, then, almost surely as d ^ oo, fd is not k-positive. 


However, at k = 
equivalent 
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From the above result, if follows that probability measures /i with the property that the maps they 
yield are positive, but not completely positive, give interesting entanglement criteria. It was shown in 
[CHN15, Theorem 5.4] that such maps can be obtained from shifted semicircular measures (3.8), and 
that they can detect PPT entanglement. The global usefulness of such entanglement criteria is left open 
(see Question 9.3). 

We discuss next the realignment criterion (RLN), introduced in [Rud03, CW02], is of different nature 
than the two other criteria we already discussed. For any matrix X G A4niC) ® Alfc(C), dehne 

= L{X) G 

where L is the realignment map, defined on elementary tensors by 

L{eie* ® faft) = eifl ® 

The realignment criterion states that a separable quantum state p G 'Dnk satisfies 

llp"'"lli < 1, 

where || • ||i is the Schatten 1-norm (or the nuclear norm). As usual, we denote by lZCMn,k the set of 
quantum states satisfying the realignment criterion 

TZCMn,k ■■= {p e Vnk : II/'"111 < 1} 2 SSVnM- 

The realignment criterion is not comparable to the PPT criterion, hence there are PPT entangled 
states detected by the RLN criterion. Since the inclusion partial relation cannot be used to compare the 
two sets/criteria, the notion of threshold is particularly interesting in this situation. The result below is 
from [AN 12]. 

Proposition 4.22. The thresholds for the sets TZCMn,k oi'e as follows: 

(1) In the balaneed regime, where n = k ^ oo, the threshold value for the parameter s of the induced 
measure is on the scale s ~ c/ at the value cq = (8/37r)^ ~ 0.72. 

(2) In the unbalaneed regime, where n —)• oo and k is fixed, the threshold value for the parameter s of 
the indueed measure Vnk,s is on the seale s ^ c at the value cq = k‘^. 

In partieular, comparing the values above with the ones in Proposition 4-18, one can conclude that, from 
a volume perspective, the realignment criterion is weaker than the PPT criterion (i.e. the thresholds for 
RLN are smaller than the thresholds for PPT). 

We gather in Table 1 the values of the thresholds for the different entanglement criteria discussed in 
this section, as well as for the set of separable states itself. The striking feature of these values is the 
fact that the (bounds for the) thresholds for the set SSV, obtained in the important work [ASY14], are 
one order of magnitude above the thresholds for the various entanglement criteria. This means that, 
from a volume perspective, the set S8V is much smaller than the set of states satisfying the different 
entanglement criteria. 

Finally, in [Lanl5], Lancien studies the performance of r-extendibility criteria for random quantum 
states. Recall that a bipartite quantum state pab £ T^nk is said to be r-extendible if there exists a 
(r -|- l)-partite state uab^ £ T^nk"^ which is invariant under all permutations of the R-systems and has 
pab as a marginal: 

[idnk ® Trfcr-i](fTAB--) = PAB- 

Obviously, any separable state pab is r-extendible, for all r > 1. Doherty, Parrilo, and Spedalieri have 
shown in [DPS04] that these conditions are also sufficient. 

Theorem 4.23. A bipartite quantum state pab £ T^nk is separable if and only if it is r-extendible with 
respect to the system B for all r gH. 

In [Lanl5], besides computing estimates on the average width of the set of r-extendible states, Lancien 
computes a lower bound for the threshold value of these sets, for fixed r. 
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Balanced regime 

re, A; —)• oo 

Unbalanced regime 

re — >■ oo, k fixed 

S£V 

^ s < log^ re [n = k] 

^ s < nk'^ \o^{nk.) 

WT 

s ~ cnk 

s ~ cnk 

II 

II 

o 

Co = 2 -|- 2y^ 1 — ^ 

n£v 

s ^ cn 

s ~ cnk 

Co = 1 

„ _ (l+v/fc+T)2 
^0 — k{k-l) 

ncN 

s ~ cnk 

s fixed 

Co = (S/Stt)^ [re = k] 

Co = A;2 


Table 1. Thresholds for the sets of separable states SSV, PPT states WT, states sat¬ 
isfying the reduction criterion TZ£T), and states satisfying the realignment criterion TZCN. 


Proposition 4.24. [Lanl5, Theorem 6.4] Fix r > 1 and consider balanced random quantum states 
Pn G having distribution ^^r any e > 0, if the function Sn is asymptotically smaller than 

(1 — e)(r — 1)^/(4r)n^ as n ^ oo, then, 

lim F[pn is r-extendible] = 0. 

72^00 

In other words, the threshold cq for the set of r-extendible states in the scaling s ~ cn^ is larger than 
(r — l)^/(4r). 

Notice that the analysis in [Lanl5] does not give any upper-bounds on the threshold cq, see Question 

9.2. 

4.5. Absolute separability. Whether a quantum state p is separable or entangled does not only depend 
on the spectrum of p: there are, for example, rank one (pure) states which are separable (p = ee* (8> //*) 
and other states which are entangled {p = n, see (2.2)). In other words, the separability/entanglement 
of a quantum state depends also on its eigenvectors. In order to eliminate this dependence, in [KZOl] the 
authors introduced the set of absolutely separable states 

AS£Vn,k = n ^ -U* = {p : UpU* is separable VI/ E Unk} C V^k- 

U &Unk 

Obviously, the truth value of p E AS£'Pn,k depends only on the spectrum A of the density operator p, so 
one could simply use 

3 AS£Vn,k = {-^ : diag(A) E AS£Vn,k]■ 

Similarly, one can define absolute versions (and the corresponding spectral variants) for the sets WT, 
TZ£V, and TZCN. 

An explicit description of the set AWT has been obtained in [Hil07], as a finite set of positive- 
semidefinite conditions. The analogue question for AIZ£T> has been settled in [JLNR15], whereas the 
problem of finding an explicit description of the set AIZCN remains open. Interestingly, it has been 
shown in [Johl3] that for qubit-qudit systems (min(n, fc) = 2), absolute separability is equivalent to the 
absolute PPT property. Later, in [AJR14] evidence towards the general conjecture AS£Vn,k = AWTn,k 
(for all n, k) has been collected; in particular, the authors show that for all n, k, AWTn,k ^ AIZCMn,k- 
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At the level of thresholds, the values (and even the scales) for AS£V and ATZCM are completely open. 

The following results, for AWT and ATZEV are from [CNY12], and respectively [JLN15]. 

Proposition 4.25. The thresholds for the sets AWTn,k are as follows: 

(1) In the balanced regime, where n > fe —?• oo, the threshold value for the parameter s of the induced 
measure Unk,s is on the scale s ~ cnlA at the value cq = 4. 

(2) In the unbalanced regime, where n —)■ oo and k is fixed, the threshold value for the parameter s of 
the induced measure Vnk,s is on the scale s ~ cnk at the value cq = {k + ■s/k'^ — 1)^. 

The thresholds for the sets ATSVn^k are as follows: 

(1) In the balanced regime, where n,k ^ oo, the threshold value for the parameter s of the induced 
measure i'nk,s is on the scale s ~ cnk at the value cq = 1. 

(2) In the first unbalanced regime, where k ^ oo and n is fixed, the threshold value for the parameter 
s of the induced measure Unk,s is on the scale s ^ ck at the value cq = n — 2. 

(3) In the second unbalanced regime, where n —?• oo and k is fixed, the threshold value for the parameter 
s of the induced measure i'nk,s is on the scale s ~ cnk at the value 

+ ^ + 



5. Deterministic input through random quantum channels 


Although a global understanding of the typical properties of a random channel is desirable (and this is 
the object of Section 6), obtaining results for interesting outputs of given random channels is of intrinsic 
interest. For example, as we explain subsequently in Section 7.2, the image of highly entangled states 
under the tensor product of random channels is an important question, as it is one of the keys to obtaining 
violations for the additivity of the minimum output entropy. 

Our first model is a one-channel model that consists in considering matrices Xn which have a macro¬ 
scopic scaling Tr(A^) ~ n • 4>{xP), where x is some non-commutative random variable. In order to obtain 
states, we normalize: 


A = 


A 

Tr A' 


Therefore, the moments of the output matrix Z = <h(A) are given by 


E[Tr(A^’)] = E[Tr(4>(A)P)] = E 


(TrA)P 


E[Tr(4>(A)P)] 
(Tr X)P 


We consider different asymptotic regimes for the integer parameters n and k. It turns out that the 
computations in the case of the regime k fixed, n —?• oo is more involved, and its understanding requires 
free probabilistic tools. To an integer k and a probability measure /i, we associate the measure 
defined by 

m = f 1 - 


Proposition 5.1. The almost sure behavior of the output matrix Z = 4’(A) is given by: 

(I) When n is fixed and k —)• oo, Z converges almost surely to the maximally mixed state 

P* — In • 
n 

(II) When k is fixed and n —)> oo, the empirical spectral distribution ofpknZ converges to the probability 
measure v = , where ffl denotes the free additive convolution operation, p is the probability 

distribution of x with respect to cp: (j){x'P) = f t^ dp{t) and p is the mean of p, p = (p{x). 

(Ill) When n. A: —>■ oo and k/n —)> c, the empirical spectral distribution of the matrix nZ converges to 
the Dirac mass 5i. 
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6. Random quantum channels and their output sets 
We do this section in the chronological order. 

6.1. Early results on random unitary channels. 

6.1.1. Levy’s lemma. Some results are already available in order to quantify the entanglement of generic 
spaces in Grp^(C"^ ( 8 > C^). The best result known so far is arguably the following theorem of Hayden, 
Leung and Winter in [HLW06]: 

Theorem 6.1 (Hayden, Leung, Winter, [HLW06], Theorem IV.1). Let A and B be quantum systems of 
dimension dA and ds with ds > dA > 3 Let 0 < a < logdA- Then there exists a subspace S C A<Si B of 
dimension 

such that all states x € S have entanglement satisfying 

H{X{x)) > logdA — CK — ft, 
where ft = dA/{dB log2) and T = 1/1753. 

For large d, Aubrun [Aub09] studies quantum channels on obtained by selecting randomly N 
independent Kraus operators according to a probability measure fa on the unitary group U{d). He shows 
the following result; 

Theorem 6.2. Consider a random unitary channel Mjv —)• Mj\f obtained with d iid Haar unitaries. For 
for N » d/e^, such a channel is s-randomizing with high probability, i.e. it maps every state within 
distance e/d of the maximally mixed state. 

This slightly improves on the above result by Hayden, Leung, Shor and Winter by optimizing their 
discretization argument. 

6.2. Results with a fixed output space. We introduce now a norm on which will have a very 
important role to play in the description of the set Kn,k,t in the asymptotic limit n —)• oo. 

Definition 6.3. For a positive integer k, embed as a self-adjoint real subalgebra TZ of a IR factor A 
endowed with trace (f, so that (p{{xi, ..., Xk)) = (xi + • • • + Xk)/k. Let pt be a projection of rank t G (0, 1] 
in A, free from TZ. On the real vector space we introduce the following norm, called the (t)-norm.' 

\\x\\(t) ■= WPtxptWoo^ ( 6 - 1 ) 

where the vector a: G zs identified with its image in TZ. 

We now introduce the convex body C as follows: 

Kk,t '■= {A G Afc I Va G A^, (A, a) < ||a||p)}, (6-2) 

where (•,•) denotes the canonical scalar product in We shall show later that this set is intimately 
related to the (t)-norm: Kky is the intersection of the dual ball of the (t)-norm with the probability 
simplex A^. Since it is defined by duality, Kk,t is the intersection of the probability simplex with the 
half-spaces 

H^{a,f) = {x G I (x,a) < ||a||(-^)} 

for all directions a G A^. Moreover, we shall show that every hyperplane H[a,t) = {x G | (x,a) = 
||a||(-j)} is a supporting hyperplane for Kk^t- 

Let P) be a probability space in which the sequence or random vector subspaces (V/)n>i is 

defined. Since we assume that the elements of this sequence are independent, we may assume that 
H = OnM Gr 7 v(C^( 8 )C”) and P = ®n>ihn where is the invariant measure on the Grassmann manifold 
Gv]\[{C^ (g) C”). Let Pn G M.nk{Tf) be the random orthogonal projection whose image is W- For two 
positive sequences {an)n and {bn)n, we write On 'C bn iff an/bn —)• 0 as n —)• oo. 
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Proposition 6.4. Let he a sequence of integers satisfying <C n. Almost surely, the following holds 
true: for any self-adjoint matrix A G A^fc(C), the Un-th largest eigenvalues of Pn{A® In)Pn converges to 
||a||(t) where a is the eigenvalue vector of A. This convergence is uniform on any compact set of A4k{C)sa- 

Proof. For any self-adjoint A G A^fc(C), the almost snre convergence follows from and from Theorem 
3.10. 

Let Ai be a countable family of self-adjoint matrices in Mk{C) and assume that their union is dense 
in the operator norm unit ball. By sigma-additivity, the property to be proved holds almost-surely 
simultaneously for all Afs. 

This implies that the property holds for all A almost-surely, as the j-th. largest eigenvalue of a random 
matrix is a Lipschitz function for the operator norm on the space of matrices. □ 

The set on which the conclusion of the above proposition holds true will be denoted by Tl' and we 
therefore have P(n') = 1. Technically, Ll' depends on Un but in the proofs, we won’t need to keep track 
of this dependence as Vn will be a fixed sequence. 

The main result of our paper is the following characterization of the asymptotic behavior of the random 
set Kn,k,t- We show that this set converges, in a very strong sense, to the convex body iLfc.t- 

Theorem 6.5. Almost surely, the following holds true: 

• Let O be an open set in containing Kk^t- Then, for n large enough, Kn^k,t C O. 

• Let 1C be a compact set in the interior of Kk^t- Then, for n large enough, 1C C Kn,k,t- 

6.3. More results about the output of random channels. More results are known about the output 
of random quantum channels. Instead of giving a full list, let us state the following result from [CFN15], 
that supersedes many results already known. 

Theorem 6.6. Let k be a fixed integer, and <1>„ : Mn(C) —>■ Mfc(C) he a sequence of quantum channels 
constructed with constant matrices and unitary matrices that are independent from each other. Then, 
there exists a compact convex set K such that its the random collection out output sets converges almost 
surely to K in the topology induced by the Hausdorff distance between compact sets. 

This theorem includes in particular encompasses the following two important examples. Firstly, the 
random unitary channels 

4>„(x) =^ 

but also, more importantly a product Xn = ® where H is any quantum channel fixed in advance, 

and is any of the sequences considered previonsly. 

Actually, there is even more, namely: in the previous theorem, the output set K can actually be 
exactly realized via the collection of outputs of pure states (no need for all input states). In addition, 
the boundary of the collection of output sets converges to the boundary of K in the Hausdorff distance 
(which means that any point in the interior of K is attained within finite time with probability one), 
and for any finite collection of I elements in the interior of K, it is possible to find with probability one 
in finite time an family of pre-images by pure states which are close to orthogonal to each other (the 
tolerance is arbitrary and can be fixed ahead of time). Somehow, this is the strongest convergence one 
can hope for, and it is actually rather counterintuitive that the image of the extreme points of a convex 
body (the input states) end up filling exactly the image of the convex body. 

As a corollary, however, we obtain the following: 

Corollary 6.7. In all examples of random channels taken so far, the Holevo capacity converges with 
probability one. In particular, if the image set K contains the identity, with probability one, 

logk. 

7. The additivity problem for tensor products of random quantum channels 

7.1. The classical capacity of quantum channels and the additivity question. The following 
theorem snmmarizes some of the most important breakthroughs in quantnm information theory in the 
last decade. It is based in particular on the papers [Has09, HW08]. 
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Theorem 7.1. For every p G [l,oo], there exist quantum channels and such that 

® ^r) < (7.1) 

Except for some particular cases {p > 4.73, [WH02] and p > 2, [GHPIO]), the proof of this theorem 
uses the random method, i.e. the channels 4* are random channels, and the above ineqnality occurs 
with non-zero probability. At this moment, we are not aware of any explicit, non-random choices for <h, 'h 
in the case 1 < p < 2. 

The additivity property for the minimum output entropy was related in [Sho04] to the addi¬ 

tivity of another important entropic quantity, the Holevo quantity 


x(^>) 


max 


Hi 



Y,p^Hlmx,)) 


The regularized Holevo quantity provides [Hol98, SW97] the classical capacity of a quantum channel 4), 
i.e. the maximum rate at which classical information can be reliably sent through the noisy channel 


7.2. Conjugate quantum channels and the MOE of their tensor product. In this subsection we 
gather some known resnlts abont the MOE of tensor products of conjugate channels 4* = 4> (8) These 
results will be used in the next subsection on counterexamples. Let us stress from the beginning that in 
there is much less known about the output eigenvalues of 4* than about those of a single random channel 
‘h. In particular, we do not have an explicit description of the output set of 4^, such as the one from 
Theorem 6.5. Actually, we have mostly upper bounds in this case, coming from the trivial inequality 

(7.2) 

where H is the maximally entangled state (2.2). 

The first result in this direction is a non-random one, giving a lower bound on the larges eigenvalue of 
the output of the maximally entangled state. To fix notation, let ‘h : Mrf(C) —>■ Mfc(C) a quantum channel 
coming from an isometry V : C'^ —>■ ® C"". In [HW08], the authors observed that in the context of two 

random channels given by two dilations Vi (resp. V 2 ), it is relevant to introduce the further symmetry 
V 2 = Vi, as it ensures that at least one eigenvalue is always big. 


Lemma 7.2. The larges eigenvalue of the output state Z 


[$(8)4'](Hd) satisfies the following inequality: 


Z 


fL 

nk 


This result appeared several times in the literature (and it is sometimes referred to as the “Hayden- 
Winter trick”), see [HW08, Lemma 2.1] or [CNlOb, Lemma 6.6] for a proof nsing the graphical (non- 
random) calculus from Section 2.4. 

In the context of random quantnm channels, one can improve on the result above, by computing the 
asymptotic spectrum of the output state Z^- This has been done in [CNlOb] in different asymptotic 
regimes. Since in this review we focns on the regime where k is fixed and d ~ tnk —)• 00 , we state next 
Theorem 6.3 from [CNlOb]. 


Theorem 7.3. Consider a sequence of random quantum channels coming from random isometries 4>„ : 

—)• Mfc(C) where dn is a sequence of integers satisfying dn ~ tnk as n —)• 00 for fixed parameters 
k £ N and t G (0,1). The eigenvalues of the output state 

A 42 (C) 3 Zn = [$n ® 

converge, almost surely as n —>■ 00 , to 

• t + with multiplicity 1; 

• with multiplicity k"^ — 1. 
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In order to prove such results, one uses the method of moments: using the Weingarten formula (3.3) 
from Section 3.2, it is shown in [CNlOb, Section 6.1] that, for all p > 1, 

pETr(ZP)= 

oi,/3eS2p 

where 7 , 6 are some fixed permutations in ^ 2 ^; we present in Figure 6 the diagram for the output matrix 
Zn- We then compute the dominating terms in the above sums, by finding the pairs (a, /?) corresponding to 
the terms having the largest n powers; this is done by replacing dn = tkn + o{n) and using the asymptotic 
expression for the Weingarten factor from Theorem 3.6. It turns out that the set of dominating pairs 
(a, 13) is small, and one can compute, up to o(l) terms, the sum, proving the result. Since the matrices Zn 
live in a space of fixed dimension (fe^), a simple variance computation allows to go from the convergence 
in moments to the almost sure convergence of the individual eigenvalues. 

Note that Theorem 7.3 improves on Lemma 7.2 in two ways: the norm of the output is larger, and we 
obtain information on the other eigenvalues too. This turns out to be useful in obtaining better numerical 
constants for the counterexamples to additivity, see the discussion in Section 7.3. 



Figure 6. Diagram for the output matrix Zn, obtained by putting a maximally entangled 
state through a product of two conjugate quantum channels. 


Finally, the last result we would like to discuss in relation to products of conjugate channels is [FN14, 
Theorem 5.2]. The setting here is more general: the authors consider not one copy of a channel and its 
conjugate, but 2 r channels (in what follows, r is an arbitrary fixed positive integer): 


Informally, [FN14, Theorem 5.2] states that, among a fairly large class of input states, the tensor products 
of Bell states (vr G Sr is an arbitrary permutation) 


n 


TT 

dn 


r 





(i) 


i=l 

are the ones producing outputs with least entropy. In the equation above, the maximally entangled state 
acts on the f-th copy of corresponding to non-conjugate channels <I>„ and on the 7r(i)-th copy of 
corresponding to conjugate channels The class of inputs among which the products of maximally 
entangled states are optimal are called “well-behaved”, in the sense that they obey a random-matrix 
eigenvalue statistics; see [FN14, eq. (43)] for more details. 

The result above shows that inequality (7.2) is tight, when restricting the minimum on the left hand 
side to the class of well-behaved input states; the general question is open for random quantum channels, 
see Question 9.7. Moreover, an important point raised in [FN14] is that the optimality of maximally 
entangled inputs extends to tensor products of channels. This result might be useful for analyzing 
regularized versions of the minimum output entropies, in relation to the classical capacity problem. 


7.3. Early results in relation to the violation of MOE, history and the state of the art. We 

present next a short history of the various counterexamples to the additivity question, discussing different 
values of the parameter p. 

In the range p > 1, the first counterexample was obtained by Werner and Holevo [WH02]: they 
have shown that the channel (1 — x)id -|- xtransp, acting on for x = —l/{d — 1 ), violates the 
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additivity of the p-Renyi entropy for all p > 4.79. Then, Hayden and Winter proved, in their seminal 
work [HW08], that random quantum channels violate additivity with large probability, for all p > 1. The 
same result, using this time free probability, was obtained in [CNllb], with smaller system dimensions. 
Also in the range p > 1, Aubrun, Szarek and Werner proved violations of random channels, using this 
time Dvoretzky’s theorem [ASWIO]. For p close to 0, violations of additivity were proved in [CHL'*’08]. 

The most important case, p = 1, turned out to be much more difficult. The difficulty comes from the 
fact that one needs a precise control of the entire output spectrum, while for p > 1 controlling the largest 
eigenvalue turned out to be sufficient. The breakthrough was achieved by Hastings in [Has09], where 
he showed that random mixed unitary channels violate additivity of the von Neumann entropy. Several 
authors, using similar techniques as Hastings, improved, generalized, and extended his result [FKMIO, 
BHIO, FKIO]. An improved version of Dvoretzky’s theorem was used in [ASWll] to show violations at 
p = 1. Later, Fukuda provided a simpler proof of violation [Fukl4], using this time e-net arguments 
and Levy’s lemma, the techniques used also in the pioneering work [HLW06]. In [BCN12, BCN13], the 
authors use free probability theory to compute exactly the minimum output entropy of a random quantum 
channel [BCN13, Theorem 5.2]. These results lead to the largest value of the violation known to date (1 
bit), and the smallest output dimension [k = 183), see Theorem 7.4 below. 

Finally, let us mention that the majority of the violation results above use random constructions. The 
exceptions are the results in [WH02] (p > 4.79) and [GHPIO] (p > 2, using the antisymmetric subspace); 
the question of finding other explicit counterexamples is open to this day, see Question 9.6. 

We state next the best result to date concerning violations of additivity for the minimum output 
entropy [BCN13, Theorem 6.3]. 

Theorem 7.4. Consider a sequence ■ 7Wp;i,„j(C) —)• Alfc(C) of random quantum channels, obtained 
from random isometries 

Vn : ^ (8)C”. 

For any output dimension k > 183, in the limit n —)• oo, there exist values of the parameter t G (0,1) such 
that almost all random quantum channels violate the additivity of the von Neumann minimum output 
entropy. For any e, there are large enough values of k such that the violation can be made larger than 1 
bit. 

Moreover, in the same asymptotic regime, for all k < 183, the von Neumann entropy of the output 
state (g) $n](ll[tA:nJ) almost surely larger than 277™™(<h„). Hence, in this case, one can not exhibit 
violations of the additivity using the Bell state (2.2) as an input for the product of conjugate random 
quantum channels. 

The above theorem leaves open the maximal possible value of the violation for conjugate random 
quantum channels, due to the fact that the maximally entangled state is not known to be optimal in this 
scenario, see Question 9.7. 

8 . Other applications of RMT to quantum spin chains volume laws 

8.1. Maximum entropy principle for random matrix product states. Random matrix techniques 
play other roles in quantum spin chain theory. In this section we follow [CGGPG13]. 

In the theory of quantum spin chains, it is nowadays widely well justified, both numerically [Whi92] 
and analytically [Has07], that ground states can be represented by the set of Matrix Product States with 
polynomial bond dimension. In the situation of a chain with boundary effects in exponentially small 
regions of size b at both ends, homogeneity in the bulk and experimental access to an exponentially small 
central region of size 1. Tracing out the boundary terms leads to a bulk state given by 
d 

P= tr{LAi,^^^---Ai^_,^RA*^_^---A*^^^)\ib+i...iN-b){jb+i-jN-b\, (8.1) 

L+lv*iV-6 = l 

where all Aj, L > 0 and 7? > 0 are D x D matrices with D = poly(A^). 

In other words, the prior information can be understood as restricting the bulk-states of our system 
as having the form (8.1). 
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It is known from the general theory of MPS [PGVWC07] that this set has a natural (over)para- 
metrization by the group U{dD), via the map U ^ Ai = (0|[/|z). In U{dD), one can use the symmetry- 
based assignment of prior distributions to sample from the Haar measure. Similarly, the fact that the 
map X I—)■ Y^-AiXA* is trace-preserving leads to consider tr{R) = I, ||L||oo < Ij and gives natural ways 
of sampling also the boundary conditions (see below). Looking for the generic reduced density matrix 
Pi of I N sites then becomes a natural problem. It corresponds to asking about generic observations 
of ID quantum systems. This idea has been already exploited for the non-translational invariant case in 
[GdOZIO]. The aim of the present work is to show that pi has generically maximum entropy: 

Theorem 8.1. Let pi he taken at random from the ensemble introduced with D > . Then Wpi/Tipi — 

l/d^||oo < {d!' — except with probability exponentially small in D. 

Note that, since the accessible region I is exponentially smaller than the system size, the bound can 
be made arbitrary small while keeping the size of the matrices D polynomial in the system size. 

To prove the theorem, one needs the graphical Weingarten calculus provided in [GNIOb] (see Sections 
2.4 and 3.3) and an uniform estimate of the Weingarten function, more subtle than the one stated in 
Theorem 3.6. 

Finally, in the same context of condensed matter physics, let us mention the work of Edelman and 
Movassagh, containing applications of random matrix theory and free probability to the study of the 
eigenvalue distribution of quantum many body systems having generic interactions [MEIl]. 


8.2. Multiplicative bounds for random quantum channels. Once the additivity questions for the 
minimum p-Renyi entropy of random quantum channels had been settled in [HW08] and [Has09], the 
attention shifted towards the amount of the possible violations of the minimum output entropy. In 
[MonI3], Montanaro shows that random quantum channels are not very far from being additive by 
bounding the minimum output oo-Renyi entropy of a tensor power of a channel by the same quantity 
for one copy of the channel. His idea is to bound the desired entropy by a additive quantity, the norm 
of the partial transposition of the projection on the image subspace of the random isometry defining the 
channel. The following theorem is a restatement of [Monl3, Theorem 3]. 

Theorem 8.2. Let <I> : Xid{^) —^ A4fc(C) be a random quantum channel having ancilla dimension n. 
Suppose k < n, min{(i,/c} > 2(log2n)^/^ and d = o{kn). Then, for any p > 1, with high probability as 
n —> oo, the following inequality holds iLI™™(4>®’') > /3(1 — l/p)iL™“(4>) where 


(3 ~ 



if d>n/k 
if d <n/k 


Soon after, Montanaro’s ideas were pursued in [FN15]. There, different additive quantities (e.g. the 
operator norm of the partial transpose of the Choi matrix of the quantum channel) were used to bound 
the minimum output 2-Renyi entropy. The results provide slight improvements, in the case of interest 
p = 1 over the bounds from [Monl3]. The following statement follows from [FN15, Theorem 8.4]. 


Theorem 8.3. Consider a sequence of random quantum channels : A4d{C) —)> A4fc(C) with ancilla 
dimension n, where k is a fixed parameter and d ~ tnk for a fixed t G (0,1). Then, almost surely as 
n —>■ oo, for all p G [0, 2], there exist constants ap G [0,1] such that, for all r >1, 

> apH^^{^^). ( 8 . 2 ) 

The constants ap satisfy the following relations 
(1) When 0 < t < 1/2 is a constant. 


Op = o(l) 


p — 1 


^ 21og2-Hog(l -t) 
logt 


1(1,2] (P)- 


2p 
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(2) When k is large and t k with t > 0, 

= o(l) + ^ 


' p—1 

t/2 

1 


ifO<T<l — l/p 
ifl-l/p<T<2 
i/r > 2. 


Incidentally, since the limiting spectrum of the partial transposition of the Choi matrix is computed 
in [FN15], the authors show the existence of PPT quantum channels violating generically the additivity 
of the minimum p-Renyi entropy, for all p > 30.95, see [FN15, Theorem 10.5]. 


8.3. Sum of random projections on tensor products. Ambainis, Harrow and Hastings [AHH12] 
consider a problem in random matrix theory that is inspired by quantum information theory: determining 
the largest eigenvalue of a sum of p random product states in (C'^)(8>A: where k and p/d^ are hxed 
while d —>■ oo. When k = 1, the Marcenko-Pastur law determines asymptotically the largest eigenvalue 
(1 + \/p/d^)‘^, the smallest eigenvalue, and the spectral density. 

More precisely, their setup is as follows: for each dimension d, let be independent uni¬ 

formly distributed rank one random projections on C'^. 


Theorem 8.4. As d ^ oo, the operator norm of 




still behaves almost surely like (1 -|- -s/p/d^Y the spectral density approaches that of the Marcenko- 
Pastur law (3.1). 

Their proof is essentially based on moment methods. Direct computation of moments of high order 
allow to conclude. Various methods are proposed by the author, including methods of Schwinger-Dyson 
type. It would be interesting to see whether these methods that are well established in theoretical physics 
and random matrix theory could be of further use in quantum information theory. This result generalizes 
the random matrix theory result to the random tensor case, and for the records, this is arguably one of 
the first precise results about the convergence of norms of sums of tensor products when the dimensions 
of each legs are the same. 

The original motivation of the authors emanates in part from problems related to quantum data-hiding. 
We refer to [AHH12] for the proofs and motivations. 


8.4. Area laws for random quantum states associated to graphs. We would like to generalize 
now Proposition 4.12 to the general case of non-adapted marginals. The theorem in this section makes 
use of random matrix theory techniques, more precisely is build on the moment computation done in 
[CNZIO, Theorem 5.4]. 

Before we state the area law, we need to properly define the boundary of a the marginal induced by a 
partition {5, T} of the total Hilbert space. In the adapted marginal case discussed in Section 4.1.4, this 
definition was natural; the general situation described here requires a preliminary optimization procedure. 

To keep things simple, assume that all local Hilbert spaces have the same dimension N. A partition 
{5, T} defines, at each vertex of the graph, a pair of non-negative integers {s{v),t{v)) such that s{v) -\- 
t{v) = deg(u) and s(v) = [S'!, ~ 1^1- randomness in the unitary operators Uy acting on 

the vertices of G introduces an “incertitude” on the choice of the copies of which should be traced 
out at each vertex v € G. The following definition of the boundary volume removes this incertitude by 
performing an optimization over all possible choices for the partial trace. Note that the case of adapted 
marginals (see Definition 4.11) does not require this optimization step, since there is no incertitude (at 
each vertex, either all or none of the subsystems are traced out). 

Definition 8.5. For a graph G and a marginal ps of the graphs state pc defined by a partition {S', T}, 
define the boundary volume of the partition as 

|(95| = maxcr(Q;), 

a 
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where a is a function a : [2m] —>■ {5, T} defining which copies of C"' are traced out, and cr(a) is the 
number of crossings in the assignment a, that is the number of edges in G having one vertex in Q!“^(S') 
and the other one in a~^{T). 

The following theorem is the main result of [CNZ13], showing that the area law holds for random 
graph states, with the appropriate definition of the boundary volume. Moreover, one can compute the 
correction term to the area law, a quantity which depends on the topology of the graph G. We refer the 
interested reader to [CNZ13, Sections 5,6] for the definition of the correction term hc^s ^md the proofs. 

Theorem 8.6. Let ps be the marginal {5", T} of a graph state ipc- Then, as N ^ oo, the area law holds, 
in the following sense 

EH{ps) = 1951 log N - hr,s + o{l), (8.3) 

where |95| is the area of the boundary of the partition {5, T} from Definition 8.5 and hr,s is a positive 
constant, depending on the topology of the graph G and on the partition {5, T} (and independent of N). 


9. Conclusions and open questions 

We finish this review article with a series of questions that seem to be of interest at the intersection of 
random matrix related techniques, and quantum information / quantum mechanics. 

In relation to the various threshold result from Section 4.4, we list next several important open ques¬ 
tions. 


Question 9.1. Is it possible to remove the log factors from Theorem f.l7 and to obtain a sharper 
threshold result for the set SEV of separable states? 

Regarding the hierarchy of r-extendibility criteria, the upper bound corresponding to the threshold 
result in 4.24 is open, see [Lanl5, Section 9.2]. 

Question 9.2. Find a constant ci > (r — l)^/(4r) such that random quantum states having distribution 
^^2 are, with high probability as n ^ oo, r-extendible. 

Regarding the random entanglement criteria introduced in Theorem 4.21, one can define 

= {yO e T)nm '■ [fd ® idm](p) > 0 almost surely, for d large enough}. 

The following question, addressing the global power of such random criteria, was left open in [CHN15]. 

Question 9.3. Define the set of quantum states satisfying all random criteria from Theorem 4-21 

r\ ^ 

^n,k,m I I 

fi: supp(/2®’*/'')c[0,oo) 

Can one give an analytical description of It was shown in [CHN15, Proposition 3.7] that the 

only pure states contained in separable (product) ones. Are there values of the parameters 

n, k, m for which the set is precisely the set of k-separable states from Vnm ? 

In Section 8.1, we have discussed a random model for matrix product states, and we have shown it 
obeys the maximum entropy principle of Jaynes. There are also natural questions related to quantum 
spin chains: 


Question 9.4. In Section 8 . 4 , we stated an area law for random quantum states. Given a random 
Hamiltonian acting on , let 77^*^ be the operator obtained from Hn acting on by the 

action of H]\f on the ith leg, and identity elsewhere. We assume that we come up with a model with a 
gap, i.e. the difference between its smallest eigenvalue and its second smallest eigenvalue is uniform. It 
follows from results by Hastings [Has07] that the ground state of the Hamiltonian Y) H^ satisfies an area 
law. If H]\f has some randomness in addition, can we obtain more precise results, e.g. regarding the 
distribution of the ground state? In the same vein, can random techniques allow us to obtain results for 
other topologies, e.g. in the 2D context? 
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Let US now consider some open questions in quantum information theory, related to random matrices. 
As discussed in Section 4.1, there are several ways in which one can define random quantum states. All 
classes of probability measures discussed in Section 4.1 are very well motivated, both from the mathemat¬ 
ical and the physical standpoints. In [NP12], the authors introduce a new ensemble of random quantum 
states, by considering iterations of random quantum channels. The following question was asked in [NP12, 
Section 4]. 

Question 9.5. Compute the statistics of the probability measure Vb on the set of quantum states Vn 
defined as follows. For a probability vector b G A^, consider the quantum channel 

4>(A) = [id„ ® Tik]{U{X ® diag(6))17*), 

where U G Unk is a random Haar unitary. Then, Vb Is the probability distribution of the unique invariant 
state of ^ (uniqueness is shown in [NP12, Theorem 4.4]. 

Regarding the various counterexamples in the literature for the minimum output entropy and other 
capacity-related questions, we list next several open problems. 

Question 9.6. Construct explicit, non-random counterexamples to the additivity of the p-Renyi entropy, 
in the range p G [1, 2]. 

Question 9.7. Is the maximally entangled state the actual minimizer of the minimum output entropy 
for a pair of conjugate random quantum channels (8) $ ? 

Question 9.8. Does a pair <I> (8) T o/ independent random quantum channels violate additivity of the 
quantities ? 

Regarding the known violations of the additivity of the MOE entropy for pairs of conjugate channels, 
it is important to note that Theorem 7.4 only allows to obtain bounds on the output dimension of the 
random channels. Previous results (see, e.g. [FKMIO]) allow to bound also the input dimension. The 
approach used in [BCN12, BCN13], using free probability, uses estimates of objects existing at the limit 
where the input dimension is infinity. It would thus be desirable, in this framweork, to be able to work at 
finite input dimension, and thus bound all the relevant parameters which allow for additivity violations. 

Question 9.9. A random contraction is known to he determinantal [Metl3] and the determinant involves 
contour integrals. So far, many random matrix techniques used for QIT rely either on concentration 
of measure, or on moment methods. Is it possible to use complex analysis methods (steepest descent, 
Riemann-Hilbert problem analysis) in order to refine existing estimates. For example, can such estimates 
give bounds for dimensions of input spaces for violation of MOE? 

Finally, we would like to end the current review with a very important open question, regarding 
different regularized quantities for random quantum channels. 

Question 9.10. Compute the almost sure limit of the regularized quantities, the Holevo capacity 

and the classical capacity for random quantum channels. 
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